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Vh . 

Mh. Abstract. Separation is a classical problem asking whether, given two 

■^^ ' sets belonging to some class, it is possible to separate them by a set from 

a smaller class. We discuss the separation problem for regular languages. 

We give a Ptime algorithm to check whether two given regular languages 

are separable by a piecewise testable language, that is, whether a SX'i(<) 

sentence can witness that the languages are disjoint. The proof refines an 

l_^ ' algebraic argument from Almeida and the third author. When separation 

pL^ I is possible, we also express a separator by saturating one of the original 

jyi^ ■ languages by a suitable congruence. Following the same line, we show that 

^ ' one can as well decide whether two regular languages can be separated 

by an unambiguous language, albeit with a higher complexity. 

> . 

T^ . 1 Introduction 

m . 

t^^ . Separation is a classical notion in mathematics and computer science. In general, 

^^ ' one says that two structures Li,L2 from a class C are separable by a set L if 

■^ . Li C L and L2 D L = . In this case, L is called a separator. 

>— ^ ' In separation problems, the separator L is required to belong to a given 

subclass Sep of 6. The problem asks whether two disjoint elements £1,-^2 of C 
can always be separated by an element of the subclass Sep. 

In the case that disjoint elements of C cannot always be separated by an 
element of Sep, several natural questions arise: 
5— i ■ (1) given elements Li,L2 in C, can we decide whether a separator exists in Sep? 

(2) if so, what is the complexity of this decision problem? 

(3) can we, in addition, compute a separator, and what is the complexity? 
In this context, it is known for example that separation of two context-free 
languages by a regular one is undecidable [Hi- 
Separating regular languages. In this paper, we look at separation problems 
for the class C of regular languages. These separation problems generalize the 
task of finding decidable characterizations for subclasses Sep of C which are 
closed under complement: a separation algorithm for a subclass Sep entails an 
algorithm for deciding membership in Sep (i.e., membership reduces to separa- 
bility). Indeed, membership in Sep can be checked by testing whether the input 
language is Sep-separable from its complement. 
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While finding a decidable characterization for Sep already requires a deep 
understanding of the subclass, the search for separation algorithms is intrinsically 
more difficult. Indeed, powerful tools are already available to decide membership 
in Sep: one normally makes use of a recognizing device of the input language, 
viz. its syntactic monoid. A famous result along these lines is Schiitzenberger's 
Theorem ,15: , which states that a language is definable in first-order logic if and 
only if its syntactic monoid is aperiodic, which is easily decidable. 

Now for a separation algorithm, the question is whether the input languages 
are sufficiently different, from the point of view of the subclass Sep, to allow this 
to be witnessed by an element of Sep. Note that we cannot use standard methods 
on the recognizing devices, as was the case for the membership problem. We now 
have to decide whether there exists a recognition device of the given type that 
separates the input: we do not have it in hand, nor its syntactic monoid. An 
even harder question then is to actually construct the so-called separator in Sep. 

Contributions. In this paper, we study this problem for two subclasses of the 
regular languages: piecewise testable languages and unambiguous languages. 

Piecewise testable languages are languages that can be described by the pres- 
ence or absence of scattered subwords up to a certain size within the words. 
Equivalently, these are the languages definable using 23Z'i(<) formulas, that is, 
first-order logic formulas that are boolean combinations of i7i(<) formulas. A 
IJi{<) formula is a first-order formula with a quantifier prefix 3*. A well-known 
result about piecewise testable languages is Simon's Theorem [17] that states 
that a regular language is piecewise testable if and only if its syntactic monoid is 
0-trivial. This property yields a decision procedure to check whether a language 
is piecewise testable. Stern has refined this procedure into a polynomial time 
algorithm |19| , of which the complexity has been improved by Trahtman |22j . 

The second class that we consider is the class of unambiguous languages, i.e., 
languages defined by unambiguous products. This class has been given many 
equivalent characterizations [20]. For example, these are the FO^(<)-definable 
languages, i.e., languages that can be defined in first-order logic using only two 
variables. Equivalently, this is the class ^2(<) of languages that are definable by 
a first-order formula with a quantifier prefix 3*V* and simultaneously by a first- 
order formula with a quantifier prefix V*3*. Note that consequently, all piecewise 
testable languages are FO^(<)-definable. It has been shown in [12] for Z\2(<), 
and in [21] for F0^(<) that these are exactly the languages whose syntactic 
monoid belongs to the decidable class DA. 

There is a common difficulty in the separation problems for these two classes. 
A priori, it is not known up to which level one should proceed in refining the 
candidate separators to be able to answer the question of separability. For piece- 
wise testable languages, this refinement basically means increasing the size of 
the considered subwords. For unambiguous languages this means increasing the 
size of the unambiguous products. For both of these classes, we are able to com- 
pute, from the two input languages, a number that suffices for this purpose. This 
entails decidability of the separability problem for both classes. 



A rough analysis yields a 3-Nexptime upper bound for separation by unam- 
biguous languages. For separability by piecewise testable languages, we obtain 
a better bound starting from NFAs: we show that two languages are separable 
if and only if the corresponding automata do not contain certain forbidden pat- 
terns of the same type, and we prove that the presence of such patterns can be 
decided in polynomial time wrt. the size of the automata and of the alphabet. 
This yields a Ptime algorithm for deciding separation by a piecewise testable 
language. 

Related work. The classes of piecewise testable and unambiguous languages are 
varieties of regular languages. For such varieties, there is a generic connection 
found by Almeida [1] between profinite semigroup theory and the separation 
problem: Almeida has shown that two regular languages over A are separable by 
a language of a variety A*V if and only if the topological closures of these two 
languages inside a profinite semigroup, depending only on A*V, intersect. Note 
that this theory does not give any information about how to actually construct 
the separator, in case two languages are separable. To turn Almeida's result into 
an algorithm deciding separability, we should compute representations of these 
topological closures, and test for emptiness of intersections of such closures. 

So far, these problems have no generic answer and have been studied in an 
algebraic context for a small number of specific varieties. Deciding whether the 
closures of two regular languages intersect is equivalent to computing the so- 
called 2-pointlike sets of a finite semigroup wrt. the considered variety, see [1]. 
This question has been answered positively for the varieties of finite group lan- 
guages |4I14) . piecewise testable languages |3I2) . star- free languages |10I9| . and 
a few other varieties, but it was left open for unambiguous languages. 

A general issue is that the topological closures may not be describable by 
a finite device. However, for piecewise testable languages, the approach of [3] 
builds an automaton over an extended alphabet, which recognizes the closure of 
the original language. This can be performed in polynomial time wrt. the size of 
the original automaton. Since these automata admit the usual construction for 
intersection and can be checked for emptiness in Nlogspace, this gives a poly- 
nomial time algorithm wrt. the size of the original automata. The construction 
was presented for deterministic automata but also works for nondeterministic 
ones. One should mention that the extended alphabet is 2^ (where A is the orig- 
inal alphabet). Therefore, these results give an algorithm which, from two NFAs, 
decides separability by piecewise testable languages in time polynomial in the 
number of states of the NFAs and exponential in the size of the original alphabet. 

Our proof for separability by piecewise testable languages follows the same 
pattern as the method described above, but a significant improvement is that 
we show that non-separability is witnessed by both automata admitting a path 
of the same shape. This allows us to present an algorithm that provides better 
complexity as it runs in polynomial time in both the size of the automata, and 
in the size of the alphabet. Also, we do not make use of the theory of profi- 
nite semigroups: we work only with elementary concepts. We have described 
this algorithm in [23] . Furthermore, we show how to compute from the input 



languages, an index that suffices to separate them. Recently, Martens et. al. [6] 
also provided a Ptime algorithm for deciding separability by piecewise testable 
languages, using different proofs but do not provide the computation of such an 
index. 

Finally, for separation by unambiguous languages, the positive decidability 
result of this paper is new, up to the authors' knowledge. It is equivalent to the 
decidability of computing the 2-pointlike sets for the class DA. 

2 Preliminaries 

We fix a finite alphabet A — {ai, . . . ,a,„}. We denote by A* the free monoid 
over A. The empty word is denoted by e. For a word u E A* , the smallest B C A 
such that u £ B* is called the alphabet of u and is denoted by alph(u). 
Separability. Given languages L, Li, L2, we say that L separates Li from L2 if 

Li C L and L2 n L = 0. 

Given a class Sep of languages, we say that the pair (Li, L2) is Sep-separable if 
some language L e Sep separates Li from ^2- Since all classes we consider are 
closed under complement, (ii,L2) is Sep-separable if and only if (L2,Li) is, in 
which case we simply say that Li and L2 are Sep-separable. 

We are interested in two classes Sep of separators: the class of piecewise 
testable languages, and the class of unambiguous languages. 
Piecewise Testable Languages. We say that a word m is a piece of v, if 

u = hi-- -bk, where 61, ... , bk G A, and v e A*biA* ■ ■ ■ A*bkA*. 

For instance, ab is a piece of bbaccba. The size of a piece is its number of 
letters. A language L C ^* is piecewise testable if there exists a k g N such that 
membership of w in L only depends on the pieces of size up to k occurring in w. 
We write w ^^^ w' when w and w' have the same pieces of size up to k. Clearly, 
~K is an equivalence relation, and it has finite index (since there are finitely 
many pieces of size k or less). Therefore, a language is piecewise testable if and 
only if it is a union of ^^-classes for some k g N. In this case, the language is said 
to be of index k. It is easy to see that a language is piecewise testable if and only 
if it is a finite boolean combination of languages of the form A*biA* ■ ■ ■ A*bkA* . 
Piecewise testable languages are those definable by !BZ'i(<) formulas. 23Z'i(< 
) formulas are boolean combinations of first-order formulas of the form: 

E3a;i . . . 3a;„ (^(xi, . . . , a;„), 

where f is quantifier-free. For instance, A*biA* ■ ■ ■ A*bkA* is defined by the for- 
mula Ela;i . . . 3xk [Ai<ki^i ^ Xi+i) A Ai<fe ^ii^i)] j where the first-order variables 
Xi,. . . ,Xk are interpreted as positions, and where b{x) is the predicate testing 
that position x carries letter b. 

We denote by PT[k] the class of all piecewise testable languages of index k 
or less, and by PT = \J^ PT[k] the class of all piecewise testable languages. 



Given L C A* and k e N, the smallest PT[K]-language containing L is 

[L]^K — {w E A* \ 3u £ L and u ^^ w}. 

In general however, there is no smallest PT-language containing a given language, 
because removing one word from a PT-language yields again a PT-language. 

Unambiguous Languages. A product L = B^aiBl ■ ■ ■ B^_^akB^ is called un- 
ambiguous if every word of L admits exactly one factorization witnessing its 
membership in L. The integer k is called the size of the product. An unambigu- 
ous language is a finite disjoint union of unambiguous products. Observe that 
unambiguous languages are connected to piecewise testable languages. Indeed, 
it was proved in [16 that the class of unambiguous languages is closed under 
boolean operations. Moreover, languages of the form A*biA* ■ ■ ■ A*bkA* are un- 
ambiguous, witnessed by the product {A\{bi})*bi{A\{b2})* ■ ■ ■ {A\{bk})*bkA* . 
Therefore, piecewise testable languages form a subclass of the class of unambigu- 
ous ones. 

Many equivalent characterizations for unambiguous languages have been 
found [20'. From a logical point of view, unambiguous languages are exactly 
the languages definable by an F0^(<) formula [21j. F0^(<) is the two- variable 
restriction of first-order logic. Another logical characterization which further 
illustrates the link with piecewise testable languages (i.e. !BZ'i(<)-definable lan- 
guages) is Z\2(<). A ^2(<) formula is a first-order formula of the form: 

32:1 ...3xn Vyi ...yy„i ip{xi,. . . ,Xn,yi,- ■ ■ ,ym), 

where tp is quantifier-free. A language is A2{<)- definable if it can be defined both 
by a 1^2 (<) formula and the negation of a -172 (<) formula. It has been proven 
in [12] that a language is unambiguous if and only if it is Z\2(<)-definable. 

For two words w, w' , we write, w =« w' if z«, w' satisfy the same unambiguous 
products of size k or less. We denote by UL[k\ the class of all languages that are 
unions of equivalence classes of =«, and we let UL — [J^ UL[k]. Observe that 
because unambiguous languages are closed under boolean operations, UL is the 
class of all unambiguous languages. 

Given L <Z A* and k £ N, the smallest UL[K]-language containing L is 

[i]sK ~ {w E A* \3u £ L and u ^^ w}. 

In general again, there is no smallest UL-language containing a given language. 

Automata. A nondeterministic finite automaton (NFA) over A is denoted by a 
tuple A = {Q,A,I,F,S), where Q is the set of states, I ^ Q the set of initial 
states, F C Q the set of final states and 6 C Q x A x Q the transition relation. 
The size of an automaton is its number of states plus its number of transitions. 
We denote by L{A) the language of words accepted by A. Given a word u £ A*, 
a subset B oi A and two states p, g of 71, we denote 

— by p > q a path from state p to state q labeled u. 

— by p -^ — > q a path from p to q oi which all transitions are labeled over B. 



— by p > q a path from p to g of which all transitions are labeled over B, 

with the additional demand that every letter of B occurs at least once along 
it. 
Given a state p, we denote by scc(p, A) the strongly connected component of p 
in A (that is, the set of states that are reachable from p and from which p can 
be reached), and by alph_scc(p,^) the set of labels of all transitions occurring 
in this strongly connected component. Finally, we define the restriction of A to 
a subalphabet BCAhyAls =* (Q, B,I,F,d HiQ x B x Q)). 
Monoids. Let i be a language and Af be a monoid. We say that L is recognized 
by M if there exists a monoid morphism a : A* ^ M together with a subset 
F C M such that L = a^^{F). It is well known that a language is accepted by 
an NFA if and only if it can be recognized by a finite monoid. Further, one can 
compute from any NFA a finite monoid recognizing its accepted language. 

3 Separation by piecewise testable languages 

Since PT[k] C PT, PT[k] -separability implies PT-separability. Further, for a 
fixed K, it is obviously decidable whether two languages Li and L2 are PT[k\- 
separable: there is a finite number of PT[k] languages over A, and for each of 
them, one can test whether it separates Li and ^2- The difficulty for decid- 
ing whether Li and L2 are PT-separable is to effectively compute a witness 
K = k(Li,L2)j ^-C-, such that Li and L2 are PT-separable if and only if they 
are PT[K]-separable. Actually, we show that PT-separability is decidable, by 
different arguments: 

(l.a) We give a necessary and sufficient condition on NFAs recognizing Li and 
L2, in terms of forbidden patterns, to test whether Li and T2 are PT- 
separable. 
(1.&) We give a polynomial time algorithm to check this condition. 
(2) We compute k e N from Li,L2, such that PT-separability and PT[k]- 
separability are equivalent for Li and ^2- Hence, if the Ptime algorithm an- 
swers that Li and T2 are PT-separable, then [Ti]^k is a valid PT-separator. 
Let us first introduce some terminology to explain the necessary and sufficient 
condition on NFAs. Let A be an NFA over A. For uq, . . . ,Up ^ A* and nonempty 
subalphabets Bi, ..., Bp C A, let u = {uq, . . . , Up) and B = {Bi, . . . , Bp). We 
call (it, B) a factorization pair. A {u, B)-path in ^l is a successful path (leading 
from the initial state to a final state of A), of the form shown in Fig. [TJ 



Bi =B, 
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-^Q^kD " ' CK^ — 0^0-0^0 -^ — -o^^ 

Fig. 1. A (u,B)-path 

Recall that edges denote sequences of transitions (see section Automata, p.[S|). 
Therefore, if A has a (m, B)-path, then L{A) contains a language of the form 
UQ{xiylzi)ui ■ ■ ■ Up-i{xpy*Zp)up, where alph(xi) U alph(zj) C a\ph{yi) = B^. 



Given NFAs Ai and A2, we say that a factorization pair {u, B) is a witness 
of non PT-separability for (Ai,A2) if there is a (m, B)-path in both Tli and A2. 
For instance, Tli and A2 pictured in Fig.[2]have a witness of non PT-separabihty, 
namely the factorization pair {u,B) with u = (e, c, e) and B = ({a, &},{a}). 



fe a 

Automaton yii Automaton yi2 

Fig. 2. A witness of non PT-separability for (Ai,A2): u — (e, c, e), i? — {{a, b}, {a}) 

We are now ready to state our main result regarding PT-separability. 

Theorem 1. Let A\ and A2 be two NFAs over A. Let Li — L{Ai) and L2 = 
L{A2)- Letki,k2 be the number of states of Ai resp. A2. Define p — max(A:i, ^2)-!- 
1 and K = p\A\2'^ I-4|(p|A|+i)^ Then the following conditions are equivalent: 

(1) Li and L2 are PT-separable. 

(2) Li and L2 are PT[k\- separable. 

(3) The language [Li]o^k separates Li from i2- 

(4) There is no witness of non PT-separability in {Ai,A2)- 

Condition |(2)| yields an algorithm to test PT-separability of regular languages. 
Indeed, one can effectively compute all piecewise testable languages of index k 
(of which there are finitely many), and for each of them, one can test whether it 
separates Li and L2- Before proving Theorem[Tl we show that Condition | (4) | can 
be tested in polynomial time (and hence, PT-separability is Ptime decidable). 

Proposition 2. Given two NFAs Ai andA2, one can determine luhether there 
exists a witness of non PT-separability in {Ai,A2) in polynomial time wrt. the 
sizes of Ai and A2, and the size of the alphabet. 

Proof. Let us first show that the following problem is in Ptime: given states 

Pi,gi,ri of yii and ^2,(72,^2 of 712, determine whether there exists a nonempty 

c n i^B^ c R 

B C A and paths pi ~ — > qi > qi -^ — > r^ in Ai for both 1 = 1,2. 

To do so, we compute a decreasing sequence {Ci)i of alphabets overapproxi- 

mating the greatest alphabet B that can be chosen for labeling the loops. Note 

that if there exists such an alphabet -B, it should be contained in 

Ci = alph_scc(qi,yii) n alph_scc(g2,-A2). 

Using Tarjan's algorithm to compute strongly connected components in linear 
time, one can compute Ci in linear time as well. Then, we restrict the automata 
to alphabet Ci, and we repeat the process to obtain the sequence {Ci)i: 

Ci+i = alph_scc(qi,yii |'Ci)nalph_scc(g2,^2 tcj- 



After a finite number n of iterations, we obtain C„ = Cn+i- Note tliat n ^ 
|alph(yii) n alph(yi2)| ^ \A\. If C„ ~ 0, then there exists no nonempty B for 
which there is an {— B)-looj> around both pi and p2- If Cn ^ 0, then it is 
the maximal nonempty alphabet B such that there are (= i?)-loops around gi 
in Ai and 52 in ^2- It then remains to determine whether there exist paths 
Pi -^ — > qi -^ — > ri and p2 — — > q2 ^ — > f2, which can be performed in linear time. 

To sum up, since the number n of iterations to get Cn = C'n+i is bounded 
by \A\, and since each computation is linear wrt. the size of Ai and A2, one 
can decide in Ptime wrt. to both \A\ and these sizes whether such pair of 
paths occurs. 

Now we build from Ai and ^12 two new automata Ai and A2 as follows. The 
procedure first initializes Ai as a copy of ^i. Denote by Qi the state set of ^i. 
For each 4-uple r — {pi,ri,p2,r2) 6 Qf x Q^ such that there exist B ^ 0, two 

states qi G Qi,q2 G Q2 and paths pi — — > qi ^— ?> qi — — > ri both for i = 1 and 
i = 2, we add in both Ai and A2 a new letter a,- to the alphabet, and "summary" 
transitions pi —^ ri and p2 — ^ ^2 . Since there is a polynomial number of tuples 
(j)i,qi,ri,p2,q2,r2), the above shows that computing these new transitions can 
be performed in Ptime. So, computing Ai and ^12 can be done in Ptime. 

By construction, there exists some factorization pair (m, B) such that Ai and 
A2 both have a (m, S)-path if and only if L{Ai) fl L{A2) ^ 0- Since both Ai 
and A2 can be built in Ptime, this can be decided in polynomial time as well. 

D 

The following is an immediate consequence of Theorem [1] and Proposition [2l 



Corollary 3. Given two NFAs, one can determine in polynomial time, with 
respect to the number of states and the size of the alphabet, whether the languages 
recognized by these NFAs are PT-separable. D 

In the rest of the section, we sketch the proof of Theorem [1] The implications 



(3)^=^(2) ^^ (1) are obvious. To show (1) =4> |(2)[ we introduce some termi- 
nology. Let us fix an arbitrary order oi < • • • < Om on A. 

{p, i3)-patterns. Let B = {61, . . . , &r} ^ A with bi < ■■■ <br, and let p G N. 
We say that a word u; G A* is a (p, B)-pattern iiw & {B*biB* ■ ■ ■ B*brB*)P. The 
number p is called the power of w. For example, set B = {a, 6, c} with a < b < c. 
The word bbaababccacbabaca is a (2, _B)-pattern but not a (3, i?)-pattern. 

^-templates. An i-template is a sequence of length £, T — ti, . . . ,ti, such that 
every ti is either a letter a or a subset B of the alphabet A. The main idea behind 
^-templates is that they yield decompositions of words that can be detected using 
pieces and provide a suitable decomposition for pumping. Unfortunately, not all 
^-templates are actually detectable. Because of this we restrict ourselves to a 
special case of ^-templates. An ^-template is said to be unambiguous if all pairs 
ti,ti+i are either two letters, two incomparable sets or a set and a letter that is 
not included in the set. For example, T = a, {&, c}, d, {a} is unambiguous, while 
T' = b,{b,c},d,{a} and T" ^ a,{b,c},{c},{a} are not. 



p-implementations. A word w G ^* is a p-implementation of an ^-template 
T = ti,...,ti ii w = wi'-'We and for all i either ti = Wi (z A or ti = 
B C A, Wi £ B* and Wi is a (p, _B)-pattern. For example, abccbbcbdaaaa = 
a.(bccbbcb) .d.( aa aa) is a 2-iniplementation of the 4-template T — a, {b, c}, d, {a}, 
since bccbbcb is a (2, {6, c})-pattern and aaaa is a (2, {a})-pattern. 



We now use p- implementations to prove (1) => |(2)[ The proof is divided 
in two steps. First, we prove that there exists p such that if two words are p- 
implementations of the same ^-template for some £, then they can be pumped into 
words containing the same pieces of size k for any fc, while keeping membership 
in the regular languages. We will then prove that if two words contain the same 
pieces for a large enough size, they are both p-implementations of a common 
unambiguous ^-template. We begin with the first step in the following lemma. 

Lemma 4. Let wi G Li and W2 G ^2- From Li, L2, we can compute p G N such 
that whenever wi, W2 are both p-implementations of an i-template T for some £, 
then for every k £N, there exist w'l G Li and w'2 G L2 such that w'l ~k w'2- 

Proof. This is a pumping argument. Let k\,k2 be the number of states of au- 
tomata recognizing Li and L2 and set p = max(A;i,fc2). Set wi,W2 and T = 
ti, . . . ,te as in the statement of the lemma. Fix k G N. Whenever ti is a set _B, 
the corresponding factors in wi,W2 are {p, i?)-patterns. By choice of p, it follows 
from a pumping argument that these factors can be pumped into {k, i3)-patterns 
in Li and L2. It is then easy to check that the resulting words have the same 
pieces of size k. D 

We now move to our second step. We prove that there exists a number k 
such that two words having the same pieces of size up to k must both be p- 
implementations of a common unambiguous ^-template (where p is the number 
introduced in Lemma S]). Again, we split the proof in two parts. We begin by 
proving that it is enough to look for ^-templates for a bounded £. 

Lemma 5. Letp G N. Every word is the p-implementation of some unambiguous 
NA-template, for Na = 22''"I'4|(pI^I+i). 

Proof. We first get rid of the unambiguity condition. Any ambiguous ^-template 
T can be reduced to an unambiguous ^'-template T' with ^' < ^ by merging the 
ambiguities. It is then straightforward to reduce any p-implementation of T into 
a p-implementation of T' . Therefore, it suffices to prove that every word is the 
p-implementation of some (possibly ambiguous) A'yi-template. 

The choice of Na comes from Erdos-Szekeres' upper bound of Ramsey num- 
bers. Indeed, for this value of Na, from every complete graph of size Na with 
edges labeled over 2'"^' colors, there must be some complete monochromatic sub- 
graph of size p\A\ + 1 (see [5 for a short proof that this bound suffices). 

Observe that a word is always the p-implementation of the ^-template which 
is just the sequence of its letters. Therefore, in order to complete our proof, it 
suffices to prove that if a word is the p-implementation of some ^-template T 
with £ > Na, then it is also the p-implementation of an ^'-template with £' < £. 
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Fix a word w, and assume that w is the p-implementation of some ^-template 
T = ti, . . . ,te with £ > Na- By definition, we get a decomposition w = wi ■ ■ ■ Wi. 
We construct a complete graph F with vertices {0, ■■■,£} and edges labeled by 
subsets of A. For all i < j, we set a\ph{wi+i ■ ■ -Wj) as the label of the edge 
(j,j). Since F has more than £ > Na vertices, by definition Na there exists 
a complete monochromatic subgraph with p\A\ + 1 vertices {ii, ■ ■ ■ ,ip\A\+i}- 
Let B be the color of the edges of this monochromatic subgraph. Let w' ~ 
Wi-^+i ■ ■ ■ Wi lAt^i , which is the concatenation of the p\A\ words, Wi-+i ■ ■ ■ Wi-^-^ , 
for j < p\A\. By construction, these words have alphabet exactly B, hence w' 
is a (p, i?)-pattern. It follows that w is a p-implementation of the ^'-template 
ti, . . . ,ti^,B,ti^^^^^2, . . . ,te with £' — I — p\A\ + 1. Hence £' < £ (except for the 
trivial case p =\A\ — 1). D 

The next lemma shown in App. A proves that once £ and p are fixed, given w it is 
possible to describe by pieces the unambiguous ^-templates that w p-implements. 

Lemma 6. Leti,p G N. Fromp and I, we can compute k such that for every pair 
of words w ~K w' and every unambiguous £-template T , w' is a p-implementation 
of T whenever w is a [p + 1) -implementation of T . 



We finish the proof of the implication (1) => |(2)| by assembling the results. 
Assume that Li is recognized by an NFA with ki states. Let p — max(fci,fc2) 
be as introduced in Lemma SI Na as introduced in Lemma [5] for p + 1, and 
K = \A\{p + 1)Na be as introduced in Lemma IHl Fix k' > k and assume that we 
have wi e Li and W2 G ^2 such that wi ^^ W2- By LemmajSJ wi is the {p + 1)- 
implementation of some unambiguous A^yi-template T. Moreover, it follows from 
Lemma ini that W2 is a p-implementation of T. By Lemma 31 we finally get that 
G Li and Wn £ -^2 such that w', ~k' wL 



The implication (1) =>|(4)|of Theorem [T] is easy and shown by contraposi- 



tion, see j23i Lemma 2] and The remaining implication 1(4)1 ^=> (1) can be shown 
using Lemma |6] (see Appendix). For a direct proof, see J23[ Lemma 3], where 
the key for getting a forbidden pattern out of two non-separable languages is to 
extract a suitable p-implementation using Simon's factorization forests |18j . 



4 Separation by unambiguous languages 

This section is devoted to proving that UL-separability is a decidable property. 
We use an argument that is analogous to property |(2)| of Theorem [T] in Section [3l 
We prove that if ii, £2 are languages, it is possible to compute a number k such 
that Li,L2 are UL-separable iff they are m[K]-separable. It is then possible 
to test separability by using a brute-force approach that tests all languages in 
UL[k]. 

In this case, we were not able to prove equivalence between UL-separability 
and the existence of some common witness inside the automata of both input 
languages. Because of this, we have a much higher complexity. A rough analysis 
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of the problem, which can be found in the Appendix, gives an algorithm that runs 
in nondeterministic 3-exponential time in k. It is likely that this can be improved. 
We now state the main theorem of this section. 

Theorem 7. Let Mi and M2 be monoids recognizing Li,L2 C A*. Let k = 
{2\Mi\\M2\ + 1)(|^| + 1)^. Then the following conditions are equivalent: 

(1) Li and L2 are UL-separable. 

(2) Li and L2 are UL[k\- separable. 

(3) The language [ii]sK separates Li from L2. 



As in the previous section, Condition |(2)| yields an algorithm for testing 
whether two languages are separable. Indeed, the algorithm simply computes all 
languages in UL[k] and checks for each of them whether it is a separator. 

Corollary 8. It is decidable whether two regular languages can be separated by 
a unambiguous language. 

Observe that in contrast to Theorem[Tl the bound k of Theorem[7]is stated in 
terms of monoids rather than in terms of automata which means an exponential 
blow-up. This is necessary for our proof technique to work. 

Another remark, is that by definition of UL[k], the bound k is defined in terms 
of unambiguous products. A rephrasing of the theorem would be: there exists a 
separator iff there exists one defined by a boolean combination of unambiguous 
products of size k. It turns out that k also works for F0^(<), i.e., there exists a 
separator iff there exists one defined by an FO^(<)-formula of quantifier rank k. 
This can be proved by making minor adjustements to the proof of Theorem [T] 

The proof of Theorem [7] is inspired from techniques used in [T3] and relies 
heavily on the notion of (i?, p)-patterns. The full proof, which works by induction 
on the size of the alphabet, can be found in the Appendix. We actually prove 
a proposition that is basically a rephrasing of Theorem [7] as a pumping-like 
property. In the remainder of this section, we state this proposition and explain 
why it implies Theorem [71 

Fix two regular languages Li,L2 over A, as well as their syntactic monoids 
Ml, M2, and the corresponding morphisms ai, a2. 

Proposition 9. Let k — (2|Mi||A/2| + 1)(|A| + 1)^. For all pairs of words wi =« 
W2 and all k! > n, there exists w'l =«' w'2 such that ai{wi) — ai{w[) and 
a2{w2) = a2{w'2). 

Let us bricfiy explain why Proposition [5] implies Theorem [7] We explain why 
the direction [(1)] =^ [(3)] holds, since [(3)] => [(2)] and [(2)] =^ [(1)] are trivial. 



We prove the contrapositive: assume that [XiJsk is not a separator. Hence, by 
definition of [Li]sk, there exist wi G Li and W2 G L2 such that wi =k, W2- Now, 
let k' > K. From Proposition [SI it follows that there exist w[ G Li and W2 G L2 
such that w[ =«' W2. This means that Li and L2 cannot be separated by a 
language of UL[k']. Since this holds for all k' > k, it follows that Li and L2 are 
not [7L-separable. 
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5 Conclusion 

We proved separation results for both piecewise testable and unambiguous lan- 
guages. Both results provide a way to decide separability. In the PT case, we 
even prove that this can be done in Ptime. Moreover, in both cases we give an 
insight on the actual separator by providing a bound on its size should it exist. 

There remain several interesting questions in this field. First, one could con- 
sider other subclasses of regular languages, the most interesting one being full 
first-order logic. Separability by first-order logic has already been proven to be 
decidable using semigroup theory ^. However, this approach is difficult to un- 
derstand, it yields a costly algorithm, it only provides a yes/no answer, and no 
insight about a possible separator. Another question is to get tight complexity 
bounds. For unambiguous languages for instance, one can show a SNexptime 
bound using translation to automata, but in this case, and even for piecewise 
testable languages, we do not know any tight bounds on the size of separators. 

Another observation is that right now, we have no general approach and are 
bound to use ad-hoc techniques for each subclass. An interesting direction would 
be to invent a general framework that is suitable for this problem in the same 
way that monoids are a suitable framework for decidable characterizations. 
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Appendix 

A: Proofs of Section [3] 

Proof of Condition |(4)| Theorem [1] 

We prove Condition [(4)] We prove [(4)] =^ [(T)] and p)] =^ [(4)| 



(4)1 =^ (1), We proceed by contraposition. Assume that Li,L2 are not PT- 
separable. Recall that Ai and A2 are NFAs for Li,L2 and that ki,k2 are their 
sizes. Set p = max(fci,fc2) + 1 and £ = 22''*'I'4|(pI'4|+i)_ We prove that Li,L2 
both contain ^-implementations of some A^^-template T and use T to construct 
a witness of non PT-separability in {Ai,A2)- 

Let K be as defined in Lemma [5] from i and p. Because Li,L2 are not PT- 
separable, there exist wi 6 Li and u;2 G L2 such that wi '--'k W2- By choice 
of £, p and Lemma El wi must be the p- implementation of some unambiguous 
^-template T. Applying Lemma|6l we obtain that wi, W2 are both (max(fci, ^2))- 
implementations of T. 

Let B = {Bi, . . . , Bn) be the subsequence of elements T that are sets. Let 
u = (uq, .. .,Un), where Ui is the word obtained by concatenating the letters 
that are between Bi and Bi+i in T. By definition (m, B) is a factorization pair. 

Because wi is a (max(fci, fc2))-implementation, the path used to read wi in Ai 
must traverse loops labeled by each of the Bi, and clearly this is a {u, B)-path. 
Using the same argument we get that the path of W2 in A2 is also a {u, B)-path. 
Therefore (m, B) is a witness of non PT-separability. 



1(3)1 =^ |(4)[ Again, we proceed by contraposition. Set k as in Theorem [T] and 
assume that there exists a factorization pair (u, B) that is a witness of non 
PT-separabilty. We prove that [-Li]^k is not a separator. 

Set B — (Bi, . . . , Bn) and m = (wq, ■ ■ ■ ,Un)- By definition, this means that 
there exists wi G Li and W2 G ^2 of the form 

UoViUiVi ■ ■ -VnUn, 

where the words such that alph(wi) = Bi and Vi contains a {Bi, n) — pattern. It 
is straightforward to see that wi ^^ W2- Therefore, W2 G iv2 H [Li]^k and [ii]^K 
is not a separator. 

Proof of Lemma [6] 

Lemma 6. Leti,p G N. Fromp andi, we can compute k such that for every pair 
of words w ~K w' and every unambiguous (.-template T , w' is a p-implementation 
of T whenever w is a (p + 1) -implementation of T . 

Proof. We prove that the lemma holds for k — \A\pi. Let w ^^ w' and let 
T = {ti,t2, ■ . ■ ,ti} be an unambiguous ^-template such that it; is a p -I- 1- 
implementation of T. We begin by giving a decomposition of w' and prove 
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that it indeed witnesses the fact that w' is a p-implementation of T. We de- 
fine w'l- ■ -w'l = w' inductively as follows: assume that the factors are defined up 
to w'^ and let u be such that w' = wi ■ ■ ■ w[ ■ u. If t^+i is a letter then ui^+i is 
just the first letter of u, otherwise i^+i — B <~ A^ and in that case w^+i is the 
largest prefix of u which contains only letters of B. We will prove that w'^- ■ ■ w'^ 
witnesses that w' is a p-implementation of T. The proof relies on a subresult 
that we state and prove below. 

To every unambiguous i'-template T — {ii, t2, ■ ■ ■ , ti} and p S N, we associate 
a piece VT,p — vi ■ ■ ■ vi, such that for all i: 

a a ti = a £ A 

{bi---bn)PiiU = {bi,...,bn}CA 

By definition, if w is a p-implementation of T then VT,p is a piece of w. Consider 
vt,i = Vi ■ ■ ■ Vi. a piece v is incompatible with T when v is of the following form: 
V = vi ■ ■ -Vi-u- Vi+i ■ ■ -Vi such that if ti (resp. ti+i) is a set the first letter (resp. 
last letter) of w is not in ti (resp. i^+i). 

Claim 1 If w is a 1 -implementation of som.e unamhiguous (.-template T, then 
there is no piece of w that is incompatible with T. 

Proof. This is a consequence of the fact that T is unambiguous. D 

We now prove that for all i, w^ —a if ti is the letter a or w[ is a word over some 
B <^ A containing a (p, i3)-pattern if ti — B. We proceed by induction, assume 
that this is true up to w'i and consider w'ij^-^. Set vt,i — vi • ■ -vg and VT,p+i = 
vi ■ ■ ■ vi. By induction hypothesis and by choice of k, we know that w^+i ■ ■ ■ Vi 
and Vi+i ■ • -vi are pieces of Wi+i • ■ • wi. We distinguish two cases depending on 
the nature of i^+i. 

Case 1: t^+i is some letter a. We have to prove that w'ij^i = a. Assume that 
w'i^i = b ^ a. Then vi ■ ■ ■ Vi ■ b ■ Vi^i ■ ■ -vg must be a piece of w' and therefore 
a piece of w {w ^^ w'). We prove that this piece is incompatible with T, which 
contradicts Claim [TJ Observe that by definition of Wi, if ti is a set then b ^ ti 
(otherwise it would have been included in ti). Therefore vi ■ ■ ■ Vi • b • Vi+i • • • w^ is 
incompatible with T and we are done for this case. 

Case 2: ti+i is a set B = {6i, . . . , &„}. By construction, Wi+i contains only letters 
in B. Therefore, we have to prove that it contains a (p, i3)-pattern. Assume that it 
does not. By contruction, the first letter of Wi+2 is some letter c ^ B. If w^+i = e, 
then vi ■ ■ ■ Vi ■ c ■ w^+i ■ ■ ■ ve must be a piece of w' . Using a similar argument to 
the one of the previous case, we can prove that this piece is incompatible with 
T, which contradicts Claim [TJ 

Otherwise, let b be the first letter of w^+i. Recall that by definition, r^+i con- 
tains a (p + 1, i3)-pattern and that r^+i • ■ • r^ is a piece of Wi+i ■ ■ ■ wg. Therefore, 
since lUi+i does not contain a (p, i?)-pattern, the last suffix of r^+i containing 
a (1, i?)-pattern must fall in Wi-^-2 and consequently, Vi^i ■ ■ ■ vg must be a piece 
of Wi+2 ■ • -wg. It follows that vi ■ ■ -Vi ■ b ■ c ■ u^+i ■ ■ -Vi is a piece of w' and of 
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w (w ~k; w'). By definition, c ^ ti+i. Moreover, by construction, if U is a set, 
then b ^ ti. Tlierefore, vi ■ ■ ■ Vi ■ b ■ c ■ u^+i • • ■ W£ is incompatible with T which 
contradicts Claim [1] since it is also a piece of w. D 

B: Proofs of Section [4] 

Complexity Analysis 

We briefly explain how UL-separability can be checked in SNexptime. This is 
a very rough analysis and it is likely that this can be improved. We use the 
connection between UL and F0^(<) to prove that any language in UL[k] is 
recognized by a deterministic automaton that is 3-exponential in k. In order to 
check separability it is then enough to non-deterministically guess an automaton 
of size 3-exponential in k, check if the automata recognizes an unambiguous 
language and check if it is a separator. This can be done in non-deterministic 
3-exponential time. 

It is straightforward to see that any unambiguous product can be described 
by an FO (<) formula whose nesting depth of quantifiers is polynomial in the 
size K of the product. One can then construct an equivalent unary temporal 
logic (UTL) formula that is exponential in size 7[. From this UTL formula, 
it is then possible to construct an equivalent deterministic automaton that is 
double exponential in size [8 . This automaton is 3-exponential in n. 

Proof of Theorem [7] 

We prove Proposition 1^1 As we explained, our proof relies heavily on the notion 
of (i?,p)-patterns. However, in this case we will only need to use (i?,p)-patterns 
where B is the whole alphabet A. For this reason, we simply write p-pattern 
for (j4,p)-pattern. We begin by giving a brief outline of the proof. We fix a 
large enough I, intuitively I needs to be large enough so that we can apply our 
pumping arguments to words of length I. Then, we show that k is large enough 
in order to ensure that wi, W2 are both a 2Z-pattern or neither of them are. In 
both cases, we are then able to decompose wi, W2 into sequences of factors that 
are pairwise equivalent and use a smaller alphabet. We then apply induction on 
these factors. In the second case, it then suffices to reconcatenate the factors to 
obtain the desired result. In the first case, a pumping argument depending on 
/ will also be needed. Before providing the lemmas involved in the construction, 
we first give some additional definitions concerning fc-patterns that will come in 
conveniently. 

Decompositions for fc-patterns. Recall that by definition, a word w is a fc- 
pattern iff u; G {A*ai ■ ■ ■ A*a„A*)'^. This means that w can be decomposed into 
a sequence of factors witnessing its membership in {A*ai ■ ■ ■ A*a„A*)'^: 



w = 

i=l 



_[ J_(Wi • fli mod n) ■ Wkn+1- 
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To each word w we associate a unique such decomposition for the largest integer 

k such that w is a fc-pattern. Let w G A* , we say that w admits a k- decomposition 

iff w is a fc-pattern but not a fc + 1-pattern. It is straightforward to see that if 

w admits a fc-decomposition then there exists an integer / such that kn < I < 

(fc + l)n, and 

I 

W = JJ(w;i -Oi mod m) • W/+1, (1) 

1=1 
such that for all i, a^ mod m ^ Wi- The integer I is called the length of the 
decomposition. We give an example of a word that admits a 1-decomposition (of 
length 5) in Figure ^. 



bcacbbcccaccbaa 

W\'W2'W-J,W4, Ws VJQ 



Fig. 3. w = bcacbbcccaccbaa over A — {a, b, c} admits a 1-decomposition 



We now state two lemmas that are needed for our construction. Both lem- 
mas link fc-decompositions to unambiguous languages. The first one states that 
fc-decompositions can be detected using an unambiguous product with large 
enough size. Moreover, using products of the same size, one can also describe 
the products that are satisfied by the factors in the fc-decomposition. 

Lemma 7. Let fc, k G N such that k > k -\- k{\A\ -\- 1). Then for every pair of 
words u ^K w and all h i^ k, u admits an h- decomposition iff v admits an h- 
decomposition. Moreover, the associated decompositions, as described in ^, are 
of the same length I: 

U = Yl,= i{Ui ■ tti mod m) ■ Ui + i 
V = Yl.^iiVi ■ Oi mod m) ' Vl + 1 

and for all i, Ui ^^ Vi. 

Proof. For the sake of readability, for all a G ^ we will write Aa for the alphabet 
A\{a} and for all number i, we will write bi for Oi mod m- Assume that u admits 
an /i-decomposition and consider the associated decomposition: 

u = ]^(ui • bi) -ui+i. 
Consider the following unambiguous product: 



P = ^(.A^rb^)■A^„^,. 
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By construction, P is of size I ^ k{\A\ + 1) < k and u ^ P. Therefore, u e P 
and V admits an /i-decomposition together with the associated decomposition: 



V = Y\_('"^ ■ ^i) ■'"i+i- 



It remains to prove that for ah i, Ui =„ fi- Assume that Ui belongs to some 
unambiguous product P' of size k. This means that u is in the unambiguous 
product: 

Because P" is of size at most k + k{\A\ + l), we have v G P". By assumption 
on the decomposition of v, this means that w,; G P' and we arc done. D 

Our second lemma states that if a word has both a prefix and a suffix that are 
large enough fc-patterns, then the middle part has no influence on membership 
in an unambiguous product. 

Lemma 8. Let k' G N, and let u, v be two words that are both k' -patterns. Then 
for all words wi,W2, the following equivalence holds: 

U • Wi ■ V =k' U ■ W2 ■ v. 

Proof. We begin by observing a simple property of unambiguous products. 

Remark 1 Let BqOiBI ■ ■ ■ B*^,_^afiiB*^i be an unambiguous product. There can 
be at most one set Bi such that Bi = A. 

Let P — BqOiBI- ■ ■ B*^,_^aK,iB*^, an unambiguous product of size k' and 
assume that uwiv G P. We prove that UW2V G P. Since uwxv G P, there exists 
some decomposition 

uwiv = xoaia;i • • • Xk'-iOk'Xk' 

that is a witness. If no set Bi is the whole alphabet A, then uwiv is at most a k'- 
pattern, which is impossible since it is by definition a 2K'-pattern. Therefore, by 
Remark [1] there is exactly one set Bi such that Bi — A. It follows that the words 
Xoaixi ■ ■ • Xi-i and a^+i • • ■ x^i-ia^'X^i are at most k' — 1-patterns. Therefore, 
they are respectively a prefix of u and a suffix of v (which are ^'-patterns). It 
follows that there exists some word j/i such that 

UW2V = XoaiXi • • • Xi-iy^ai+i ■ ■ ■ x^^'^ia^'X^' . 

Such a decomposition for UW2V is witness for membership in P. We conclude 
that UW2V G P. 

D 
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We can now finish our construction by using Lemma [7] and Lemma [8] and 
prove Proposition [9l The proof goes by induction on the size of the alphabet. 

We begin by fixing the size of the patterns we are going to look for in wi^'W2- 
Set k = |Mi||Af2|. A pigeonhole principle argument proves that for sq, . . . , s/c G 
Ml and tq, . . . , r^ G A/2, there exist i < j such that both sq ■ ■ ■ Si„i — sq ■ ■ ■ Sj 
and ro • • • r^-i = rg ■ ■ ■ rj. We will look for fc-patterns. 

Observe that k = {2k + l)(m + 1)^ (recall that m = \A\). We prove Propo- 
sition |9] by induction on m. Let k' > k, wi =k W2 and set k = (2k + Vjvn? . By 
Lemma [71 either both wi,W2 admit /i-decompositions for some h < 2k or both 
wi, W2 do not admit /i-decomposition for h ^ 21. We treat these cases separately. 

Case 1: both wi and W2 admit /i-decompositions for h < 2k. Observe that 
k — {2k + l)m? implies that n > k + 2k{m + 1). Therefore, we can apply the 
second part of Lemma [7] to wi and W2- This yields the following decompositions: 

Wl = Yli^i{ui ■ flj mod m) • Ul+1 
^2 = rii^lC^i • Oi mod m) ' Wj+i 

such that for all i, Ui =« Vi. Also observe that by definition of these decompo- 
sitions Ui^Vi use a strict subalphabet of A. Therefore, the induction hypothesis 
can be used and for all i we get words u[, v[ such that u'^ =«/ v[, ai{ui) — ai{u'j) 
and a2{vi) — a2{v[). Now, consider the words: 

^1 ^ llj;=lV^i ■ ^i mod m,) ' "^l-^-l 
W2 — [[i^i{'Vi ■ di mod m) • "^l + i 

By construction, we have w'l =„/ w'2, ai{wi) = ai{w'i) and a2{w2) — 02(^2 ) 
and we are done. 

Case 2: both wi and W2 do not admit /i-decompositions for h ^ 2k. This 
means that wi,W2 are both 2fc-patterns. A simple argument as in the proof of 
Lemma [7] shows that wi,W2 can be decomposed into three factors: 

Wl — ui ■ u^ ■ u^ 

W2 — Vi ■ Vc ■ Wr 
such that U[ =^_k(m+l) V[, Ut =f^-kl^rn+l) Vt, U^ =f^_k(^m+i) Vc and U[, V(, Wr, Vt 

admit fc-decompositions. We prove that it is possible to construct u[,v[,u[,v'^ 
such that: 

(1) u[ =K.> v[ and uj. =«' v'^. 

(2) ai{ui) = ai{u[), ai(ur) = ai{u'^), a2(wi) = a2{v[) and a2(wr) = a2{v'^). 

(3) u[,v[,u[,v[. are all re'-patterns. 

We only give the proof for u[, v[, as the proof for u[, v'^ is identical. Observe 
that K — k{m -I- 1) > k -I- k{m + 1). Therefore, we can apply Lemma[7]to ui and 
f[. This yields the following decompositions: 

Ul = Yli^i{Ui ■ tti mod m) ■ Ul+i 
Vl = rii^lC^i ■ Oi mod m) ■ Vl + 1 
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such that for aU i, Ui =« Vi- Moreover the length I of the decompositions is 
I > km. Also observe that by definition of these decompositions, Ui,Ui use a 
strict subalphabet of A. Therefore, the induction hypothesis can be used and for 
all i we get words u'^,v'^ such that u^ =k' v'l, ai{ui) — ai(u^) and a2{vi) — a2{v'^)- 
We now use the fact that our choice of k allows us to pump the sequences of 
factors into large sequences: there exist numbers ^ j'l < J2 ^ k such that 

riiiV " "ll""! • "i "lod m) = nS "l("i • ^i mod m) 
Hill <^2{v[ ■ tti mod m) = ]\f=l OLl{u'^ ■ O^ rnod m) 



Now set, 



nj2« / 

i=(ji-l)n+l ^i ■ '^i mod m 

62 = n: 



J2n 



V: ■ a,. 



i=(ji-l)n+l ""i ""* mod m 

lli=l(^i ' ^* mod m) ' (eij • [[i^J2n+l\''^i ' ^* mod m) ' ^l + l 
«1 = Ul=liv'i ■ fli mod m) ■ (£2)^' ■ U\=j2n+li^i ' «> ™orf ™) ' «/+l 

Observe that by construction u[ ^k' vJ (they are products of pairwise equiv- 
alent factors). Moreover, by construction of ei, 62, we have ai(u[) = ai{u[) and 
ct2{vi) = ai{v[). Finally, since 61,62 are 1-patterns, u[,v[ are K'-patterns. Us- 
ing the same construction for Mtjt'r) we obtain u[,v'^. We can now finish our 
construction by giving w'i,w'2: 



By construction it is clear that ai{wi) = ai{w[) and a2(w2) = a2(w2). 
Moreover, because u\^u[. are K'-patterns it follows from Lemma [8] that 

u[ ■ Uc ■ u[ =„/ u[ ■ Vc ■ u'^ 
Finally, since u[ =«;' v[ and u'^ =«' v[., we have 

u[ -Vc-u'^ =K' v[-Vc- v'^ 
Combining the two equivalences we obtain w'l =k.' w'2 ■, which concludes the proof. 



